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We respond to the issues discussed by Farmer and Lillo (FL) related to our proposed approach to 
understanding the origin of power-law distributions in stock price fluctuations. First, we extend our 
previous analysis to 1000 US stocks and perform a new estimation of market impact that accounts 
for splitting of large orders and potential autocorrelations in the trade flow. Our new analysis 
shows clearly that price impact and volume are related by a square-root functional form of market 
impact for large volumes, in contrast to the claim of FL that this relationship increases as a power 
law with a smaller exponent. Since large orders are usually executed by splitting into smaller size 
trades, procedures used by FL give a downward bias for this power law exponent. Second, FL 
analyze 3 stocks traded on the London Stock Exchange, and solely on this basis they claim that the 
distribution of transaction volumes do not have a power-law tail for the London Stock Exchange. We 
perform new empirical analysis on transaction data for the 262 largest stocks listed in the London 
Stock Exchange, and find that the distribution of volume decays as a power-law with an exponent 
« 3/2 — in sharp contrast to FL's claim that the distribution of transaction volume does not have a 
power- law tail. Our exponent estimate of « 3/2 is consistent with our previous results from the New 
York and Paris Stock Exchanges. We conclude that the available empirical evidence is consistent 
with our hypothesis on the origin of power-law fluctuations in stock prices. 
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We recently proposed a testable theory for the ori- 
gin of the empirically-observed power-law distributions 
of financial market variables such as stock returns, vol- 
umes, and frequency of trades 0. Our theory explains 
the power-law exponent of the distribution of returns by 
deriving a square-root functional form for market impact 
("square-root law") that relates price impact and order 
size. Our previous empirical analysis gave results that 
support the square- root law of market impact. 

Farmer and Lillo (FL) raise some issues related to 
the empirical validity of the theory proposed in Ref. 0. 
Their discussion is based on the following arguments: 

1. FL claim that the price impact function grows 
slower than a square-root law. Interestingly, FL's 
empirical analysis does find a power-law relation- 
ship for market impact with exponent (3 » 1/2 for 
volumes smaller than a threshold, consistent with 
the square-root form of market impact in our the- 
ory. However, for large volumes FL claims (3 < 0.2 
[for the New York Stock Exchange (NYSE)] and 
(3 w 0.26 [based on analyzing 3 stocks in the Lon- 
don Stock Exchange (LSE)]. FL do not compute 
error-bars for for large volumes, but claim that, 
from a visual comparison, [3 — 1/2 (square-root 
law) is inconsistent with the data. 

2. FL argue that the empirical analysis that we pre- 
sented in support of the square-root functional form 



of market impact Q is "invalidated" by the "long- 
memory nature of order flow." 

3. FL analyze the volume distribution of 3 stocks from 
the London Stock Exchange and claim that the vol- 
ume distribution does not follow a power-law. Con- 
sequently, FL conclude that volume fluctuations do 
not determine the power-law tail of returns. 

We first outline our responses to these criticisms and 
then present our detailed response with results of our new 
analysis. 

1. FL find f3 < 1/2 for large volumes from analyzing 
the average value of return for a trade for a given 
trade size. FL's procedure for estimating price im- 
pact is flawed since large orders are usually exe- 
cuted by splitting into smaller size trades, so the 
procedure used by FL gives a downward bias for the 
power law exponent (3 defined in our theory 0, [j| , 
giving rise to an apparent exponent value (3 1 smaller 
than the correct value (3. In fact FL's procedure 
gives (3 < 1/2 for large volumes — precisely the do- 
main in which we expect the order-splitting effect 
to be dominant — and therefore a downward bias for 

2. Although we present new estimators to address this 
point, we believe FL's argument to be incorrect 
since long-memory in order flow clearly does not 
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imply the same for returns, so FL's criticisms about 
our estimation procedure do not seem relevent. To 
address a potential problem of long memory in or- 
der flow, we draw from a new estimator for mea- 
suring market impact Q and extend our previous 
analysis to the 1000 largest NYSE stocks. Our new 
estimation confirms that the market impact func- 
tion does behave as a square-root function of the 
volume. 

3. We analyze 262 largest stocks listed in the London 
Stock Exchange. Our analysis of volume distribu- 
tion for these 262 stocks shows that the distribution 
of volume decays as a power-law with an exponent 
ps 3/2 in agreement with our previous results for 
the NYSE and the Paris Bourse. In fact our anal- 
ysis of the volume distribution for the same stocks 
analyzed by FL shows a clear power-law behavior 
with exponent pa 3/2 — in contrast to FL's claim. 

Define Si as the price of the stock after trade i, 

Spi ee log Si - log Si-l ( 1 ) 

as the return concomitant to trade i, so the return over 
a fixed time interval At is 

N 

i=i 

where N is the number of trades in At. Let be the 
number of shares traded in trade i so 

N 

Q = £> (3) 

i=l 

is the total volume in interval At. We define the trade 
imbalance 

N 
i=l 

where e, = 1 indicates a buyer-initiated trade and e = — 1 
denotes a seller initiated trade. We denote by V the size 
of a large order, which can be executed in several trades. 
1. Measuring market impact 

Let Ap be the change in price caused by a large order 
of size V', all else remaining the same. Our theoretical 
approachyj derives a power-law functional form for the 
market impact function ||, 

Ap ~ V 13 . (5) 

We hypothesized (3 — 0.5 and supported the hypothesis 
by empirical analysis. 

Our hypothesis Eq. JSJ pertains to Ap, the total impact 
in price of a large order of size V. In practice, as outlined 



in Ref. 1], large orders are executed by splitting into 
orders of smaller size which are observed in the trade 
time series as the trade size c^. The empirical analysis 
of Refs. and Q refer to the relationship local price 
change ~E(Sp\q) and not the price impact E(Ap|V) that 
we are interested in. The true market impact function 
E(Ap|V) is indeed notoriously difficult to measure since 
the information about the unsplit order size is usually 
proprietary and not available, neither in our data nor in 
the data analyzed by FL. 

FL claim that the price impact function grows more 
slowly than a square-root function for large volumes. The 
basis for their claim is the analysis presented in Ref. 
that E(dp\q) ~ q with f3 = 0.5 for small q and f3 = 0.2 
for larger q. While ~E(Sp\q) indeed grows less rapidly 
than a square-root, as reported in Ref. 01, ~E(5p\q) nei- 
ther quantifies price impact of large trades, nor does it 
contradict our theory and empirical results 0. This is 
because a trade by trade analysis of ~E(8p\q) leads to a 
biased measurement of full price impact and the expo- 
nent (3, since it does not take into account the splitting 
of trades [10. 

Consider an example. Suppose that a large fund wants 
to buy a large number V of shares of a stock whose price 
is $100. The fund's dealer may offer this large volume 
for a price of $101. Before this transaction, however, the 
dealer must buy the shares. The dealer will often do that 
progressively in many steps, say 10 in this example. In 
the first step, the dealer will buy V/10 shares, and the 
price will go say, from $100 to $100.1, and in the second 
the price will go from $100.1 to $100.2. After some time 
elapses, the price will have gone to $101 in increments of 
$0.1. At this stage, the dealer has his required number 
of shares, and hands them over to the fund manager at 
a price of $101. The true price impact here is 1%, since 
the price has gone from $100 to $101. But in any given 
transaction, the price has moved by no more than $0.1. 
So Ref. 0] would find an "apparent" price impact of no 
more than $0.1, i.e. 0.1% of the price. Since as the 
transaction is executed the price of the stocks goes from 
$100 to $101, the true price impact is 1%. As a result 
the procedure of FL will measure a value 10 times smaller 
than the true value. This downward bias explains why 
FL find in Fig. 2 a maximum impact of 0.1% — a very 
small price impact. Other evidence in economics |llLll2l] 
finds impacts that are up to 40 times larger than that 
of FL's analysis. Likewise our evidence pertains to large 
impacts, captured by Fig. 2 of our paper which shows 
on the vertical axis values of r 2 equal up to 200 times 
the variance a 2 , i.e., values of return r up to v200 pa 14 
standard deviations. 

We can quantify the bias in the above example. Sup- 
pose that a trade of size V is split into K = V a (10 
in our example) trades of equal size q = V/ K = V 1 ^ , 
with < a < 1. Then the apparent impact bp incurred 
by each trade (0.1 % in our example) will be X/K (1/10 
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in our example) of the total price impact V 13 (1% in our 
example), i.e. Sp = V p /K = V p ~ a . So a power law fit 
of Sp vs q, such as the one presented in Fig. 2 of FL, will 
give Sp ' 



q@ with 



= [fi - a) I (1 - a) < [3. 

The "trade by trade" measurement of the price impact, 
as performed by Ref. 0,0, leads to a biased measurement 
j3' of the exponent (3 of the true price impact. 

It is to address this bias that we examine ~E(r 2 \Q) in 
Ref. 0] . As is well established empirically, the sign of re- 
turns is unpredictable in the short term, so the reasoning 
in Ref. 0| shows that E(r 2 |Q) will not be biased Q. 

Our analysis lj was presented with data for the 116 
most actively traded stocks. To check if the result of 
(3 = 0.2 for large volumes presented in Refs. Q and 3 
could arise from increasing the size of the database, we 
now extend our analysis to the 1000 largest stocks in our 
database for the 2-yr period 1994-95. Figure^a) confirms 
that E(r 2 |Q) ~ Q as predicted by our theory. 
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FIG. 1: (a) Conditional expectation function E(r 2 |Q) of the 
squared return for a given volume for At — 15 min for 1000 
largest stocks in the NYSE, Amex and NASDAQ for the pe- 
riod 1994-95. We have normalized r for each stock to zero 
mean and unit variance, and Q is normalized by its first cen- 
tered moment. This normalization procedure allows for a data 
collapse for different stocks and the plot represents an average 
for 1000 stocks. Regressions in the range 3 < Q < 70 give 
values of the exponent (3 — 1.05±0.08. (b) Conditional expec- 
tation function E (r 2 |g max ) of the squared return for a given 
5„ lax for At = 15 min. Here g max is the largest trade size in 
thel5 min interval. Power-law regression gives the value of the 
exponent 2/9 = 1.09±0.06 consistent with /3 — 0.5. (c) Condi- 
tional expectation function E (r 2 |g max ) of the squared return 
for a given g max for fixed number of trades N = 40. Power-law 
regression gives the value of the exponent 2/3 = 1.10 ± 0.06. 
Source: Ref. Hi. 



2. Robustness of estimation against long- 
memory of order flow 

FL argue that the empirical analysis that we presented 
in support of the square-root functional form of market 
impact is "invalidated" by the "long-memory nature of 
order flow". FL's argument is based entirely on the as- 
sumption that returns due to each transaction i can be 
written as rj = e^gf where e; = 1 for a buy trade and 
e. L = — 1 for a sell trade. Under this assumption, FL then 
argue that our estimator E(r 2 |Q) is affected by the long- 
range correlations in the trade signs 0, 0] . FL give 
some numerical evidence for this potential effect for small 
to moderate volumes. 

All of the tests shown by FL are for a fictitious return 
fi constructed on a trade- by-trade basis as fi — eiqf '. 
FL's argument and the tests shown in Fig. 1 of FL are 
for the fictitious return. In reality, return can certainly 
not be expressed as = eigf , with ej being the trade 
indicator. Indeed, if this were true, returns themselves 
would be long-range correlated — a possibility long known 
to be at odds with empirical data. Since the sign of 
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the return r% and that of the trade sign 6j are clearly 
not equal, FL's argument about our estimation procedure 
being affected by the long-memory nature of the trade 
sign (a) is incorrect (See Ref. 0] on a related point). 

Although FL's argument is incorrect, to address the 
more general concern that the autocorrelations of the 
trade signs e» might bias our analysis, we draw from a 
forthcoming paper Q , which performs the following anal- 
ysis • For each intervl At define g max as the size of the 
largest trade. In our theory, if the largest trade V max is 
large, it will have a the major influence on the value of re- 
turn, so that one will have r 2 ~ Knax ~ 9max- Hence g max 
gives us a diagnostic value of the behavior of the largest 
trade, independently of a potential collective behavior. 
We detail this in Ref. Q. We compute E(r 2 |g max ) and 
find [Fig. 

E(r 2 |g max ) ~ <7 max . (6) 



In addition to the above, to ensure that our estima- 
tion is robust to varying number of trades in a fixed At, 
we have computed E(r 2 |g max ) for fixed number of trades 
instead. Figure J^c) shows that E(r 2 |(7 max ) ~ g max for r 
over TV = 40 trades. 

We would like to emphasize that in Fig. D we consider 
very large trades that are up to 70 times the first moment 
of volume. They correspond to returns of up to 14 stan- 
dard deviations of returns. This confirms that we study 
very large trades and returns — the ones that are relevant 
for the study of power law fluctuations, while in contrast 
FL's analysis does not systematically treat large trades. 

We conclude that the procedure used in Ref. Q has 
a downward bias of the price impact f3 of large trades. 
When we perform more appropriate analysis, we confirm 
that the (3 w 1/2. This corroborates our hypothesis 0,0 
that large fluctuations in volume cause large fluctuations 
of prices. 

3. Half-cubic power-law distribution of volumes 

The last claim of FL pertains to the very nature of the 
volume distribution. They present the results of their 
analysis of three stocks in the London Stock Exchange 
and claim their analysis shows no evidence for a power- 
law distribution. 

We analyze the same database which records all trades 
for all stocks listed in the London Stock Exchange. From 
this database, we first examine one stock - Vodafone, 
VOD — which is analyzed by FL. For this stock, we 
compute the volume distribution and find clear evidence 
for a power-law decay [Fig. Ufa)] 

P(q) ~ q-^- 1 (7) 

with exponent £ ? = 1.5 ± 0.1, in agreement with our 
previous results for the NYSE and the Paris Bourse 0, 
Il3j . but in sharp contrast to the FL results who claim a 
thin-tailed distribution for the same data. 
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FIG. 2: (a) Probability density function of trade volumes for 
Vodafone Inc. (VOD) for 2001. A power-law fit in the region 
10 < q < 1000 gives value of the exponent ( q = 1.5 ± 0.1. 
In contrast FL finds much more rapid decay, (b) Probability 
density function of trade volumes for 10 largest stocks listed in 
the London Stock Exchange, showing clear evidence of power- 
law decay with exponent £ 9 = 1.5 ± 0.1, consistent with our 



previous results 13] for the New York Stock Exchange and for 
the Paris Bourse. Here q are normalized by its first centered 
moment, so all 10 distributions collapse on one curve. We 
find an average exponent ( q =1.59 ±0.09. (c) Same as (b) 
for all 265 stocks in our sample where q is normalized for 
each stock by its first centered moment. We find ( q = 1.4 ± 
0.09. (d) Probability density function of trade volumes for 30 
largest stocks listed in the Paris Bourse obtained by the same 
procedure. We find C 9 = 1-49 ± 0.03 @. 



Zumbach for helpful conversations and help with obtain- 
ing the data analyzed. 

Note added in press: After our initial submission, 
it has become clear that FL's claim of a non-power-law 
distribution of trade sizes [2] for the LSE stocks is based 
on incomplete data. FL's analysis excludes the upstairs 
market jig trades which contain the largest trades in the 
LSE. In contrast, our result of a 3/2 power-law exponent 
for the volume distribution is based on data containing 
all trades (both the upstairs and the downstairs market 
trades) in the LSE. By excluding the large trades in the 
upstairs market, FL set an artificial truncation at large 
volume, so FL's finding of a non-power-law distribution 
of volume is merely a trivial artifact of incomplete data. 
Although FL claim in their note added that "it has been 
shown that large price fluctuations in the NYSE (includ- 
ing the upstairs market) and the electronic portion of the 
LSE are driven by fluctuations in liquidity" their new 
analysis and findings are tainted by the same problems 
as in their present comment: (i) incompleteness (absence 
of the upstairs market trades) of the data analyzed and 
(ii) they do not take into account the splitting of large 
orders. 

Gabaix et al. 0,0 and FL 2] discuss two distinct pos- 
sibilities respectively: (i) large price changes arise from 
large trades and (ii) lar ge p rice changes arise from fluc- 
tuations in liquidity |l4lll7j . While we believe that both 
mechanisms play a role in determining the statistics of 
price changes, our empirical findings support the pos- 
sibility that the specific power-law form of the return 
distribution arises from large trades. 



For the 10 largest stocks in our sample, Fig. Efb) 
shows that P(q) is consistent with the same power-law 
of £ 9 = 3/2 — consistent with our earlier finding for the 
NYSE 13]. We extend our analysis to the 265 largest 
LSE stocks and find similar results [Fig.|3fc)]. 

To test the universal nature of this distribution, we an- 
alyze data for 30 largest stocks listed in the Paris Bourse 
and find that P(q) is consistent with the a power-law 
with almost identical exponents ( q = 3/2 [Fig.E^d)]. 

In summary, the analysis of Ref. Q pertains to small 
to moderate trades. FL's estimation is biased for large 
trades, so FL can detect only very small price impacts, 
less than 0.1%. When we use our more general proce- 
dure and study significantly larger data, we confirm our 
initial finding of a square root price impact function. We 
conclude that the available evidence is consistent with 
our hypothesis 0, that large fluctuations the volume 
traded by large market participants may contribute sig- 
nificantly to the large fluctuations in stock prices 0. 
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